NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

PSBD: Prediction shift uncertainty unlocks backdoor detection

Li, Wei; Chen, Pin-Yu; Liu, Sijia; Wang, Ren (July 2025, Proceedings of the Computer Vision and Pattern Recognition Conference)

Full Text Available
Rethinking Evaluation Metrics for Machine Unlearning

Shi, Yingdan; Liu, Sijia; Wang, Ren (May 2025, ICML 2025 Workshop MUGen)

Full Text Available
SEUF: Is Unlearning One Expert Enough for Mixture-of-Experts LLMs?

https://doi.org/10.18653/v1/2025.acl-long.424

Zhuang, Haomin; Zhang, Yihua; Guo, Kehan; Jia, Jinghan; Liu, Gaowen; Liu, Sijia; Zhang, Xiangliang (July 2025, Association for Computational Linguistics)

Full Text Available
When is task vector provably effective for model editing? a generalization analysis of nonlinear transformers

Li, Hongkang; Zhang, Yihua; Zhang, Shuai; Wang, Meng; Liu, Sijia; Chen, Pin-Yu (May 2025, 2025 International Conference on Learning Representations (ICLR))

Task arithmetic refers to editing the pre-trained model by adding a weighted sum of task vectors, each of which is the weight update from the pre-trained model to fine-tuned models for certain tasks. This approach recently gained attention as a computationally efficient inference method for model editing, e.g., multi-task learning, forgetting, and out-of-domain generalization capabilities. However, the theoretical understanding of why task vectors can execute various conceptual operations remains limited, due to the highly non-convexity of training Transformer-based models. To the best of our knowledge, this paper provides the first theoretical characterization of the generalization guarantees of task vector methods on nonlinear Transformers. We consider a conceptual learning setting, where each task is a binary classification problem based on a discriminative pattern. We theoretically prove the effectiveness of task addition in simultaneously learning a set of irrelevant or aligned tasks, as well as the success of task negation in unlearning one task from irrelevant or contradictory tasks. Moreover, we prove the proper selection of linear coefficients for task arithmetic to achieve guaranteed generalization to out-of-domain tasks. All of our theoretical results hold for both dense-weight parameters and their low-rank approximations. Although established in a conceptual setting, our theoretical findings were validated on a practical machine unlearning task using the large language model Phi-1.5 (1.3B).
more » « less
Full Text Available
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers

Li, Hongkang; Zhang, Yihua; Zhang, Shuai; Chen, Pin-Yu; Liu, Sijia; Wang, Meng (April 2025, The Thirteenth International Conference on Learning Representations (ICLR))

Full Text Available
From Trojan Horses to Castle Walls: Unveiling Bilateral Data Poisoning Effects in Diffusion Models

Pan, Zhuoshi; Yao, Yuguang; Liu, Gaowen; Shen, Bingquan; Zhao, H Vicky; Kompella, Ramana Rao; Liu, Sijia (December 2024, neurips)

Full Text Available
Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models

Zhang, Yimeng; Chen, Xin; Jia, Jinghan; Zhang, Yihua; Fan, Chongyu; Liu, Jiancheng; Hong, Mingyi; Ding, Ke; Liu, Sijia (December 2024, neurips)

Full Text Available
To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy to Generate Unsafe Images ... For Now

https://doi.org/10.1007/978-3-031-72998-0_22

Zhang, Yimeng; Jia, Jinghan; Chen, Xin; Chen, Aochuan; Zhang, Yihua; Liu, Jiancheng; Ding, Ke; Liu, Sijia (September 2024, Springer Nature Switzerland)

Full Text Available
Rethinking machine unlearning for large language models

https://doi.org/10.1038/s42256-025-00985-0

Liu, Sijia; Yao, Yuanshun; Jia, Jinghan; Casper, Stephen; Baracaldo, Nathalie; Hase, Peter; Yao, Yuguang; Liu, Chris Yuhao; Xu, Xiaojun; Li, Hang; et al (February 2025, Nature Machine Intelligence)

Full Text Available
Secondary Organic Aerosol from OH-Initiated Oxidation of Mixtures of d -Limonene and β-Myrcene

https://doi.org/10.1021/acs.est.4c04870

Liu, Sijia; Galeazzo, Tommaso; Valorso, Richard; Shiraiwa, Manabu; Faiola, Celia L; Nizkorodov, Sergey A (July 2024, Environmental Science & Technology)

The chemical composition and physical properties of secondary organic aerosol (SOA) generated through OH-initiated oxidation of mixtures containing β-myrcene, an acyclic monoterpene, and d-limonene, a cyclic monoterpene, were investigated to assess the extent of chemical interactions between their oxidation products. The SOA samples were prepared in an environmental smog chamber, and their composition was analyzed offline using ultra-performance liquid chromatography coupled with electrospray ionization high-resolution mass spectrometry (UPLC-ESI-HRMS). Our results suggested that SOA containing β-myrcene showed a higher proportion of oligomeric compounds with low volatility compared to SOA from d-limonene. The formula distribution and signal intensities of the mixed SOA could be accurately predicted by a linear combination of the mass spectra of SOA from individual precursors. Effects of cross-reactions were observed in the distribution of isomeric oxidation products within the mixed SOA, as evidenced by chromatographic analysis. On the whole, β-myrcene and d-limonene appear to undergo oxidation by OH largely independently from each other, with only subtle effects from cross-reactions influencing the yields of specific oxidation products.
more » « less
Full Text Available

« Prev Next »

Search for: All records